Picture for Xinyu Chen

Xinyu Chen

TAPS: Target-Aware Prefix Tree Selection for Diffusion-Drafted Speculative Decoding

Add code
May 30, 2026
Viaarxiv icon

SimInsert: Seamless Video Object Insertion via Regional Sparse Attention Fusion

Add code
May 22, 2026
Viaarxiv icon

DEFLECT: Delay-Robust Execution via Flow-matching Likelihood-Estimated Counterfactual Tuning for VLA Policies

Add code
May 19, 2026
Viaarxiv icon

EMA: Efficient Model Adaptation for Learning-based Systems

Add code
May 13, 2026
Viaarxiv icon

FashionStylist: An Expert Knowledge-enhanced Multimodal Dataset for Fashion Understanding

Add code
Apr 13, 2026
Viaarxiv icon

A Layer-wise Analysis of Supervised Fine-Tuning

Add code
Apr 12, 2026
Viaarxiv icon

INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling

Add code
Apr 08, 2026
Viaarxiv icon

MSVBench: Towards Human-Level Evaluation of Multi-Shot Video Generation

Add code
Feb 27, 2026
Viaarxiv icon

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Add code
Nov 16, 2025
Viaarxiv icon

Medical Referring Image Segmentation via Next-Token Mask Prediction

Add code
Nov 07, 2025
Viaarxiv icon